Developing a syntactic analyser for Estonian
نویسنده
چکیده
The aim of the present article is to give an overview of the current state of syntactic analysis of Estonian and describe problems that were encountered in the generation of syntactic rules for the syntactic analyser of Estonian. So far only the rules based on linguistics have been used. This article is focused on the statistical methods in syntactic analysis and it describes the experiments of using corpus-based patterns in syntactical disambiguation.
منابع مشابه
An Estonian Morphological Analyser and the Impact of a Corpus on Its Development
The paper describes a morphological analyser for Estonian and how using a text corpus influenced the process of creating it and the resulting program itself. The influence is not limited with the lexicon only, but is noticeable in the resulting algorithm and implementation too. When work on the analyser started, there was no computational treatment of Estonian derivatives and compounds. After s...
متن کاملParsing Estonian with Constraint Grammar
This paper describes the current state of syntactic analysis of Estonian using Constraint Grammar, focusing mainly on the determination of syntactic functions. Constraint Grammar of Estonian was written in 1996-2000 at the University of Tartu. The author has developed its syntactic part.
متن کاملDetermination of Syntactic Functions in Estonian Constraint Grammar
This article describes the current state of syntactic analysis of Estonian using Constraint Grammar. Constraint Grammar framework divides parsing into two different modules: morphological disambiguation and determination of syntactic functions. This article focuses on the last module in detail. If the morphological disambiguator achieves the precision more than 85% and error rate is smaller tha...
متن کاملFinite-state Relations Between Two Historically Closely Related Languages
Regular correspondences between historically related languages can be modelled using finitestate transducers (FST). A new method is presented by demonstrating it with a bidirectional experiment between Finnish and Estonian. An artificial representation (resembling a protolanguage) is established between two related languages. This representation, AFE (Aligned Finnish-Estonian) is based on the l...
متن کاملShallow Parsing of Spoken Estonian Using Constraint Grammar
In this paper we describe how we have adapted the syntactic analyzer of written Estonian to the spoken language. The Constraint Grammar shallow syntactic parser (Müürisep et al. 2003) was used for the automatic syntactic analysis of the corpus of Estonian spoken language (Hennoste et al. 2000). To adapt the parser, the clause boundary detection rules as well as some syntactic constraints had to...
متن کامل